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Abstract 

The Sloan Digital Sky Survey (SDSS) is collecting photometry and intermediate resolution 
spectra for ~ 10^ stars in the thick-disk and stellar halo of the Milky Way. This massive 
dataset can be used to infer the properties of the stars that make up these structures, and 
considerably deepen our vision of the old components of the Galaxy. We devise tools for 
automatic analysis of the SDSS photometric and spectroscopic data based on plane-parallel 
line-blanketed LTE model atmospheres and fast optimization algorithms. A preliminary 
study of about 5000 stars in the Early Data Release gives a hint of the vast amount of infor- 
mation that the SDSS stellar sample contains. 

1.1 Introduction 

The Sloan Digital Sky Survey is an ambitious project that is imaging about one 
fourth of the sky with five broad-band filters. The survey includes followup intermediate- 
dispersion (A/(5A ~ 1800) spectroscopy (York et al. 2000). The final catalog is expected to 
include photometry and spectroscopy for about 10^ and 10^ sources, respectively. Focused 
on extragalactic science, the spectroscopic survey aims at amassing the largest possible col- 
lection of galaxy redshifts. The dedicated f/5 Ritchey-Chretien-like 2.5m telescope has a 
three-degree field of view. In the spectroscopic mode, up to 640 (180/im or 3 arcsec 0) 
fibers can be simultaneously positioned on the focal plane to feed two identical spectro- 
graphs. Each spectrograph has a blue and a red arm that provide continuous coverage in the 
range 381-910 nm. 

The selection criteria for the spectroscopic targets are rather complex (Eisenstein et al. 
2001; Richards et al. 2002; Stoughton et al. 2002; Strauss et al. 2002). Galaxies and 
quasar candidates take about 90% of the fibers, with the remaining used to observe the sky 
background and Galactic stars, which are either selected for being peculiar (brown dwarfs, 
blue-horizontal branch stars, carbon stars, etc.), or intended for reddening and flux calibra- 
tion. In addition, almost a third of the quasar candidates in the Early Data Release turned out 
to be stars. Nearly ^ 10^ stellar spectra will be released by the end of the survey in 2006. 
With exposure times per plate of the order of 45 minutes, the targeted stars have V magni- 
tudes in the range 14-21, signal-to-noise ratios (S/N) between 5 and 150, and lie at distances 
of up to hundreds of kiloparsecs from the Galactic plane. When released, the SDSS stellar 
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spectra will constitute the largest spectroscopic survey of the Galactic thick-disk and halo 
populations yet assembled. 

1.2 Analysis 

The analysis of a massive dataset calls for automated procedures. The SDSS images 
and spectra are processed by a series of pipelines specialized in tasks such as astrometric, 
wavelength, and flux calibrations, aperture photometry, or the extraction and classification 
of spectra. Starting from the released photometry and spectra for Galactic stars, we are in- 
terested in a detailed classification based on the fundamental atmospheric parameters. Pro- 
jected radial velocities are directly measurable from the spectra. Given the large spectral 
coverage, we expect to be able to quantify the interstellar reddening towards the observed 
stars. Stellar metalhcities, and even perhaps the abundance ratios of chemical elements or 
groups of elements well-represented in the spectra, are obviously some of the most valuable 
information to search for. It is also most interesting to use the derived stellar parameters, 
chemical abundances, and interstellar reddening to infer distances and ages. 

We make use of the SDSS (ugriz) photometry and the ~ 3800 pixels in an object's spec- 
trum altogether. We found it helpful to trade resolution for S/N, and therefore the spectra are 
smoothed to A/5 A = 1000 by convolution with a Gaussian profile. As absolute fluxes are not 
relevant at this point, we use photometric indices and normalize the spectra {5,} to satisfy 
X^^j^i/m = 0.5. The relevant data vector is T= {u-g,g-r,r-i,i-z,Si,S2,S3,... ,S„}, 
where m = 2600. We model T with plane-parallel line-blanketed LTE model atmospheres 
and radiative transfer calculations, as a function of the stellar parameters (effective tem- 
perature Teff, surface gravity g, and overall metallicity [Fe/H]*). The collection of model 
atmospheres and low-resolution synthetic spectra of Kurucz (1993) is used. This grid was 
calculated with a mixing-length I /Hp = 1.25, and a micro-turbulence of 2 km s"'. The low- 
dispersion spectra are convolved with the SDSS filter responses (Strauss & Gunn 2001). The 
atmospheric structures are used to produce LTE synthetic spectra with a resolving power 
X/SX= 1000 between 381 and 910 nm. Balmer line profiles are treated as in Hubeny, Hum- 
mer, & Lanz (1994). The radiative transfer equation is solved with the code synspec 
(Hubeny & Lanz 2000), using very simple continuous opacities: H, H~, Rayleigh and elec- 
tron scattering (with the prescriptions in Hubeny 1988). The calculations included 131821 
atomic line transitions, but no molecular features. 

Both photometric magnitudes and spectra are computed for a discreet 12 x 4 x 6 grid 
spanning the ranges 4500 to 10000 K, 2.0 to 5.0 dex, and -4.5 to +0.5 dex in T^ff, logg (c.g.s 
units), and [Fe/H], respectively. The interstellar reddening (E(B-V)) is parameterized as 
in Fitzpatrick (1999), adopting R = A{V)/E{B-V) = 3.1. This parameter gives one more 
dimension to the grid. We consider E(B-V) in the range 0.0-0.1 with just three values. 
Model spectra and photometry for sets of parameters off the grid nodes are derived by multi- 
linear interpolation. 

Some elements show abundance ratios to iron that are non-solar in metal-poor stars. This 
is largely ignored in our modeUng. However, we consider enhancements to the abundances 
of Mg and Ca in metal-poor stars when calculating synthetic spectra because these elements 
produce strong lines on which our analysis heavily relies. FoUowing Beers et al. (1999) we 
adopt [a/Fe] ~ [Ca/Fe] ~ [Mg/Fe]: 



* [E/H] =log -log I ^Tji J , where iV represents number density of a chemical element. 
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Fig. 1.1. Comparison of observed spectra (black line) and photometric indices 
(open circles) with the best-fitting model (red line and green circles) for one of 
the EDR stars. The inset shows an enlarge view of the bluest part of the spectrum. 
The primes in the photometric magnitudes are related to technical subtleties be- 
tween different calibrations. For our purposes these are the same as the non-primed 
magnitudes we refer to in the text. 



if > [Fe/H] 

[a/Fe] = { -0.267 [Fe/H] if-1.5 < [Fe/H] < (1.1) 
+0.4 if [Fe/H] <- 1.5. 

We perform a search for the model parameters that minimize the distance between the 
model T {Tf^ff,g, [Fe/H], E(B-V)) and the observations' vector O. In a fashion, we define 
such distance as 

MJ+4 

f, = Y^WiiOi-T,f (1.2) 
/=i 

where the weights W,- are optimized for performance. The search is accomplished by using 
either the Nelder-Mead simplex method (Nelder & Mead 1965) or a genetic algorithm (Car- 
roll 1999). The multi-linear interpolation was deemed as accurate enough after repeating 
the interpolations with the decimal logarithm of the fluxes. Testing with finer grids in \ogg, 
[Fe/H], and E(B-V), led only to marginal variations in the results. Classification of a single 
spectrum takes a few seconds on a 600 MHz workstation. 

Fig. Il.ll shows an example of observed (black) and model (red and green) fluxes for one of 
the EDR stars that we fit with T^ff = 5388 K, \ogg = 4.71, [Fe/H]=-1 .2, and E{B-V) = 0.004. 
The inset plot gives an expanded view of the blue part of the spectrum. Note that molecular 
features, such as the G band (CH) at ~ 4300 A are not reproduced by the model spectrum. 



3 



Allende Prieto et al. 



The photometric indices have been scaled to fit in the graph's box and placed at arbitrary 
wavelengths. 

Once the atmospheric parameters are defined, we make use of stellar evolutionary calcu- 
lations by the Padova group (Alongi et al. 1993; Bressan et al. 1993; Fagotto et al. 1994; 
Bertelli et al. 1994) to find the best estimates for other stellar parameters: radius. My, mass 
(M), Age, etc. With the atmospheric parameters and their uncertainties in hand we define 
a normalized probability density distribution that is Normal for Teff and \ogg, and a boxcar 
function in log(Z/ZQ) 



P (X exp 



eff 



\V2a(T^s) 



exp 



logg-logg* 
V2a(logg) 



B(log(Z/ZQ)), (1.3) 



which is then used to find the best estimate of a stellar parameter X by integration over the 
space iZ/H, Age, and initial mass M) that characterizes the stellar isochrones of Bertelh et 
al. (1994) 

X= j I I XP(Z/H,Age,M)d(Z/ZQ)d(Age)dM. (1.4) 

Jz/H J Age Jm 

The isochrones employed do not consider enhancements in the abundances of the a el- 
ements for metal-poor stars. Thus, we simply equate [Fe/H] = log(Z/Zo). More realistic 
relations should take this into account, and will be explored in the future. 

Finally, the magnitudes in the SDSS passbands can be used to estimate the Johnson V 
magnitudes of the stars (Zhao & Newberg 2002). Knowing My and the reddening, it is then 
straightforward to derive distances. 

1.3 Checking on T^s and [Fe/H] 

The inclusion of the Ca K line and the seven first members of the Balmer series in 
the EDR spectra makes them suitable for the application of well-tested techniques developed 
for the followup of stars in the HK survey (Beers, Preston & Shectman 1992; Beers et al. 
1990; Beers et al. 1999). 

After measuring the pseudo-equivalent widths of the relevant spectral features and esti- 
mating the (B-V) colors from the SDSS photometry, we obtain a second, relatively indepen- 
dent, measure of the metallicities of the EDR stars. After visual inspection, a subsample of 
1910 stars was deemed of reasonably good quality, and their metallicities are compared with 
those determined by spectral fitting in the upper panel of Fig. 11.21 The overall agreement 
for [Fe/H] > -2.5 is reasonable, with a scatter of about 0.5 dex. A systematic discrepancy is 
apparent for the most metal-poor stars. This disagreement exceeds the internal uncertainties 
of each method and should be investigated. We have found that, in the presence of severe 
noise, the optimization algorithms tend to underestimate the metallicity. We should also note 
that a spurious feature is apparent in many EDR spectra right between the Ca H and K lines, 
which might be affecting the Ca 11 K method. 

The (B-V) colors estimated from the SDSS ugriz photometry and the stellar metallicities 
were fed to the photometric calibrations of Alonso et al. (1996, 1999), which are based on 
the Infrared Flux Method (IRFM; Blackwell, Shallis & Selby 1979). These calibrations have 
an internal scatter of about 2% and an uncertainty in the zero point of about 1%. Therefore, 
they offer a reliable external check to the TeffS determined automatically from the simulta- 
neous analysis of EDR spectra and photometry. The interstellar reddening was corrected 
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Fig. 1.2. Top panel: Comparison between the [Fe/H] derived by our spectral fitting 
technique and those with the K method of Beers et al. (1999). Lower panel: The 
effective temperatures from fitting the spectra are compared to those based on the 
photometry and the IRFM calibrations of Alonso et al. (1996, 1999). The dashed 
Unes have a slope of one. 

using the values determined spectroscopically. The lower panel of Fig. ll.2l shows a pleasing 
correspondence between the two Teff determinations. The IRFM TeffS are, on average, lower 
by 66 K with an rms scatter between the two scales of 160 K. 

1.4 Application to the EDR. Preliminary results. 

The SDSS begun standard operations in April 2000. The Early Data Release (EDR; 
Stoughton et al. 2002) was made public on June 5, 2001. It consists of 462 square degrees 
of imaging data and 54008 spectra. The data were acquired in three regions, two of them 
following the celestial equator in the southern and northern Galactic skies, and a third which 
overlaps with the SIRTF First Look Survey (Storrie-Lombardi et al. 2001). We have selected 
the spectra that were finally identified as stellar. Our sample includes 5604 objects, but we 
were only able to identify Balmer lines in 4714, whose distribution in Galactic coordinates 
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Fig. 1.3. Location of the EDR stars in galactic coordinates for each hemisphere. 
The galactic latitude corresponds to the radius in the plot and the longitude to the 
azimuth. 
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Fig. 1.4. Distribution of brightness for the EDR stars. V magnitudes were not di- 
rectly measured, but inferred from the SDSS ugriz fluxes following Zhao & New- 
berg (2002). 



is shown in Fig. 11.31 The brightness distribution of the sample ranges between V = 14 and 
21, and it is depicted in Fig. 1 1.41 

It is interesting to compare the distances found for the sample as a function of the stellar 
Teff. In Fig. 11.51 dwarf stars define a tilted band that marks the minimum distance at a 
given Teff. The K-type dwarfs in the sample are located at distances of 1-2 Kpc. Warmer 
stars on the main sequence are more distant, with an obvious drop in density at Teff ~ 6500 
K. Some areas in Fig. 1 1.51 are underpopulated. Subgiants cause the overdensity in a band 
nearly perpendicular to the main-sequence that crosses it at about Teff = 7400 K. As stars 
evolve off the main-sequence, they become cooler but more luminous, and can be seen at 
larger distances. A second band parallel to the first is weakly apparent intersecting the main- 
sequence at a Teff of ^ 6000-6300 K. Interpreting these features as 'turn-offs', this diagram 
marks two preferred ages for the sample. 
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Fig. 1.5. Position of the EDR stars in the T^ff- distance plane. 
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Fig. 1.6. Position of the EDR stars in the [Fe/H] - distance plane. 



The limiting magnitude for the sample of EDR stellar spectra is V ~ 20-21. G-type giant 
stars with My ~ 1 allow us to reach out to distances as far as ~ 50 Kpc, but supergiants with 
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Fig. 1.7. The graph shows which stars cover which range of distances. Two com- 
ponents have been fitted by least-squares to the data. These can be associated with 
the halo (blue), and thick disk (red; but also contaminated with halo stars). 



My ~ -4 extend our scope about ten times farther The sharp drop in density of stars for 
distances larger than about 200 Kpc may or may not be a selection effect. 

Another interesting projection of the data is on the plane [Fe/H] vs. distance (see Fig. 
II. 6> . The most metal-poor stars ([Fe/H]< -3) appear only at distances larger than 3 Kpc. It 
is apparent in the Figure that many stars clump at small distances (< 4 Kpc) in the metallicity 
range -1 .2 <[Fe/H] < -0.4. It is very tempting to identify this population with the thick disk 
(see, e.g., Gilmore & Reid 1983; Reddy et al. 2003). The concentration of stars at exactly 
the grid nodes in [Fe/H] (-3.5,-2.5, . . .) is an artifact of the search algorithm that we do not 
understand yet. 

Fig. II. 71 shows the range of distances that each luminosity class covers. In this Figure, 
the thick disk and halo populations are clearly separated. The thick disk stars show a density 
distribution that rapidly falls beyond 2 Kpc. The halo star counts, however, decline slowly 
with distance. Our next challenge is correcting the involved selection effects to study the 
true density law of the halo. 

The distribution of stars in Fig. I1.6l can be collapsed on one axis at a time, as shown in 
Fig. 11.81 In the upper panel, the clump of stars that we identify with the thick disk becomes 
even more obvious. The distribution of brightness for the EDR stars peaks at V ~ 18 mag, 
which corresponds to distances of ^ lO^ "* pc for giants and ~ 10^^ pc for supergiants. The 
continuous shape of the number density of stars between 10^ and 10^'^ pc strongly suggests 
that the observed density decline is mainly driven by the reality in the Galactic stellar halo. 

K, G, and F-type dwarfs in the EDR allow us to cover the range ^ 10^-10^ pc, and 
therefore they essentially are the thick disk population in our sample (see Fig. 1.6). Nearby 
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Fig. 1.8. Histograms of the number of stars analyzed in the EDR sample as a 
function of distance (upper panel) and [Fe/H] (lower). In the lower panel, a two- 
Gaussian model has been fitted (red) curve. The individual components of the 
model are also shown in black. 



giants and supergiants brighter than V '--^ 14 mag are rejected by the selection algorithm to 
avoid saturating the detector. They can only cover distances larger than 10^'^ pc. Together, 
these selection effects provide a simple explanation for the decrease in number density of 
stars at 10^^"^^ pc. In fact, the star counts at 10^^ pc seem to recover the trend apparent at 
distances larger than 10 Kpc. 

The lower panel of Fig. II. 8 I re veals that the metallicity distribution can be approximately 
modeled with only two Gaussian components. A first component, or thick-disk, centered 
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at [Fe/H] = -0.9 (cr = 0.4) dex, and a second-component, or stellar halo, centered at [Fe/H] 
= -2.1 (ct = 0.5) dex. With the selected bin size (0.02 dex), the artificial overdensities at the 
grid nodes are easy to spot and have been excluded from this figure. 




Fig. 1.9. In an attempt to estimate the abundance ratio between the a elements 
(Mg and Ca) and Fe, we vary the weights in our fitting procedure to favor some 
lines over others as metallicity indicators. When all lines are considered we derive 
[A/H], and when bias the analysis for, and against, lines of Mg and Ca, we obtain 
[a/H] and [Fe/H], respectively. The red line corresponds to the assumption adopted 
for calculating the synthetic spectra, as given in Eq. 11.11 

The weights W, in Eq. II. 21 are about 500 times larger for any of the SDSS photometric 
indices than for any given pixel in the spectra. The weights for the spectra are heavily biased 
towards lines, which carry most of the information on the parameters we are interested in. 
The relevant lines in the optical spectrum of a metal-poor star, at the resolution and S/N we 
are dealing with, are only a few: Balmer lines, Ca 11 H and K, the Ca 11 IR triplet, the Na 
D lines, the Mg 1 b triplet, and a number of strong lines of the iron-peak elements (mainly 
Fe 1). By adjusting the weights, one has the capability of using some groups of lines and 
disregarding others, biasing the results. 

In Fig. 11.91 we have renamed the metallicities derived in the standard case of using all 
possible lines (previously referred to as [Fe/H]) as [A/H]. The results of two new runs where 
the lines of Ca and Mg were given enhanced weights, while lines of the iron-peak elements 
were disregarded, and viceversa, are labeled as [a/H] and [Fe/H], respectively. For the 
most metal-poor stars, in particular for the warmer stars, most metal lines are too weak for 
detection in the EDR spectra, and Ca II K remains as the only reliable metallicity indicator 
In that regime, we expect [a/Fe] to increase quickly, even exceeding unity, and [a /A] — > 1 . 
This may be happening for the lowest values of [Fe/H] or [A/H] in Fig. 11.91 However, both 
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[a/Fe] and [a/A] may still be reliable at [Fe/H] ~ [A/H] ~ -2, where a clear discrepancy is 
shown between measurements and our assumptions about the enhancement of a elements in 
Eq. 1 1.1 1 (shown as the red curve in Fig. II. 9> . The agreement is, however, fair in the interval 
-1.5 < [Fe/H] ~ [A/H] <0. 

1.5 Conclusions 

The spectra of Galactic stars acquired in the course of the Sloan Digital Sky Sur- 
vey (SDSS), although collected mainly for calibration purposes, constitute an unprecedented 
database to study some of the oldest stellar populations in the Milky Way. The stellar atmo- 
spheric parameters and the interstellar reddening are directly disentangled with reasonable 
accuracy from the spectra and broad-band photometry through standard spectral analysis 
techniques. Chemical abundances for selected elements can also be extracted. Given the 
volume of data - SDSS will probably obtain spectra for ^ 10^ stars - the analysis requires 
automated procedures that we implement through a pre-calculated grid of model fluxes cou- 
pled to an optimization algorithm. Stellar evolution theory allows us to constrain interesting 
stellar parameters such as masses, radii, and ages, as well as to estimate distances, once the 
atmospheric parameters have been derived. 

A preliminary analysis of nearly 5000 spectra in the Early Data Release (EDR) shows 
most SDSS dwarf stars belong to the thick disk of the Milky Way. This population is con- 
sistent with a scale height of 1 - 2 Kpc, in agreement with previous results. Giants and 
supergiants trace the Galactic halo up to 200 Kpc from the plane of the disk. The separation 
between thick disk and halo is evident in the range of distances they occupy, and their chem- 
ical abundances. We also expect these two populations to be distinct in soon-to-be explored 
ages and kinematics. 
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